MULTI-MODAL UTILE DISTINCTIONS Multi-Modal Utile Distinctions

نویسندگان

  • William Dabney
  • Amy McGovern
چکیده

We introduce Multi-Modal Utility Trees (MMU), an algorithm for autonomously learning decision treebased state abstractions in Partially Observable Markov Decision Processes with multi-modal observations. MMU builds the trees using the Kolmogorov-Smirnov statistical test. Additionally, MMU incorporates the ability to perform online tree restructuring, enabling it to build and maintain a compact state approximation. It then uses reinforcement learning over these abstractions to learn a control policy. We empirically evaluate the performance of MMU on the Tsume-Go subtask of Go, and show knowledge transfer from the single step Tsume-Go task to the multi-step Tsume-Go task. In the single step task MMU outperforms the best existing decision tree based methods and performs competitively with a Support Vector Machine method for solving Tsume-Go problems. We are unaware of existing work on multi-step Tsume-Go, but MMU reaches over 40% accuracy on subsequent gameplay steps. A random agent would achieve less than 9%, and thus there is a significant amount of knowledge transfer. The use of multi-modal observations allows us to use a variety of observation representations which are combined to form the abstractions learned by MMU. This provides flexibility in the choice of observation representation which is lacking in existing methods for the online learning of state abstractions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Instance-Based Utile Distinctions for Reinforcement Learning with Hidden State

We present Utile Suffix Memory, a reinforcement learning algorithm that uses short-term memory to overcome the state aliasing that results from hidden state. By combining the advantages of previous work in instance-based (or “memorybased”) learning and previous work with statistical tests for separating noise from task structure, the method learns quickly, creates only as much memory as needed ...

متن کامل

Michael Kremer Marti on Descriptions in Carnap ’ S S 2

This note is a friendly amendment to Marti’s analysis of the failure of Føllesdal’s argument that modal distinctions collapse in Carnap’s logic S2. Føllesdal’s argument turns on the treatment of descriptions. Marti considers how modal descriptions, which Carnap banned, might be handled; she adopts an approach which blocks Føllesdal’s argument, but requires a separate treatment of non-modal desc...

متن کامل

Semantically Enriched Models for Modal Sense Classification

Modal verbs have different interpretations depending on their context. Previous approaches to modal sense classification achieve relatively high performance using shallow lexical and syntactic features. In this work we uncover the difficulty of particular modal sense distinctions by eliminating both distributional bias and sparsity of existing small-scale annotated corpora used in prior work. W...

متن کامل

Roles, Rigidity, and Quantification in Epistemic Logic

Epistemic modal predicate logic raises conceptual problems not faced in the case of alethic modal predicate logic: Frege’s “Hesperus-Phosphorus” problem— how to make sense of ascribing to agents ignorance of necessarily true identity statements—and the related “Hintikka-Kripke” problem—how to set up a logical system combining epistemic and alethic modalities, as well as others problems, such as...

متن کامل

Modal Ontology and Generalized Quantifiers

Timothy Williamson has argued that in the debate on modal ontology, the familiar distinction between actualism and possibilism should be replaced by a distinction between positions he calls contingentism and necessitism. He has also argued in favor of necessitism, using results on quantified modal logic with plurally interpreted second-order quantifiers showing that necessitists can draw distin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010